Parameters for Speaker Identi cation

نویسندگان

  • Nick J.-C. Wang
  • Wei-Ho Tsai
  • Lin-Shan Lee
چکیده

Eigen-MLLR coe cients are proposed as new feature parameters for speaker-identi cation in this paper. By performing principle component analysis on MLLR parameters among training speakers, the eigen-MLLR coe cients (EMCs) are derived as the coe cients for the eigenvectors. The discriminating function of the new EMC features based on the Fisher criterion is found to be ten times larger than that of mel-frequency cepstral coe cient (MFCC) features, for distinguishing speakers. The speaker-identi cation accuracy using the EMC features are shown to be signi cantly better than that using MFCC features, especially when the quantity of enrollment data is limited. It is also shown that properly combining MFCC and EMC features can achieve a signi cant error rate reduction on the order of 50%-60% as compared to using MFCC features alone.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Selective use of the speech spectrum and a VQGMM method for speaker identification

This paper describes two separate sets of speaker identi cation experiments. In the rst set of experiments, the speech spectrum is selectively used for speaker identi cation. The results show that the higher portion of the speech spectrum contains more reliable idiosyncratic information on speakers than does the lower portion of equal bandwidth. In the second set of experiments, a vector-quanti...

متن کامل

Speech compression with preservation of speaker identity

Although much e ort has been directed recently towards speech compression at rates below 4 kb/s, the primary metric for comparison has, understandably, been the amount of spectral distortion in the decompressed speech. However, an aspect which is becoming important in some applications is the ability to identify the original speaker from the coded speech algorithmically. We investigate here the...

متن کامل

Bispectrum features for robust speaker identification

Along with the spoken message, speech contains information about the identity of the speaker. Thus, the goal of speaker identi cation is to develop features which are unique to each speaker. This paper explores a new feature for speech and shows how it can be used for robust speaker identi cation. The results will be compared to the cepstrum feature due to its widespread use and success in spea...

متن کامل

On the use of visual information for improving audio-based speaker recognition

Audio-based speaker identi cation degrades severely when there is a mismatch between training and test conditions either due to channel or noise. In this paper, we explore various techniques to fuse video based speaker identi cation with audio-based speaker identication to improve the performance under mismatch conditions.

متن کامل

Audio-visual speaker recognition for video broadcast news: some fusion techniques

Audio-based speaker identi cation degrades severely when there is a mismatch between training and test conditions either due to channel or noise. In this paper, we explore various techniques to fuse video based speaker identi cation with audio-based speaker identi cation to improve the performance under mismatched conditions. Speci cally, we explore techniques to optimally determine the relativ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001